Search CORE

18 research outputs found

The distorted mirror of Wikipedia: a quantitative analysis of Wikipedia coverage of academics

Author: Samoilenko Anna
Yasseri Taha
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/12/2013
Field of study

Activity of modern scholarship creates online footprints galore. Along with traditional metrics of research quality, such as citation counts, online images of researchers and institutions increasingly matter in evaluating academic impact, decisions about grant allocation, and promotion. We examined 400 biographical Wikipedia articles on academics from four scientific fields to test if being featured in the world's largest online encyclopedia is correlated with higher academic notability (assessed through citation counts). We found no statistically significant correlation between Wikipedia articles metrics (length, number of edits, number of incoming links from other articles, etc.) and academic notability of the mentioned researchers. We also did not find any evidence that the scientists with better WP representation are necessarily more prominent in their fields. In addition, we inspected the Wikipedia coverage of notable scientists sampled from Thomson Reuters list of "highly cited researchers". In each of the examined fields, Wikipedia failed in covering notable scholars properly. Both findings imply that Wikipedia might be producing an inaccurate image of academics on the front end of science. By shedding light on how public perception of academic progress is formed, this study alerts that a subjective element might have been introduced into the hitherto structured system of academic evaluation.Comment: To appear in EPJ Data Science. To have the Additional Files and Datasets e-mail the corresponding autho

arXiv.org e-Print Archive

Springer - Publisher Connector

Analysing Timelines of National Histories across Wikipedia Editions: A Comparative Computational Approach

Author: Lemmerich Florian
Samoilenko Anna
Strohmaier Markus
Weller Katrin
Zens Maria
Publication venue
Publication date: 01/01/2017
Field of study

Portrayals of history are never complete, and each description inherently exhibits a specific viewpoint and emphasis. In this paper, we aim to automatically identify such differences by computing timelines and detecting temporal focal points of written history across languages on Wikipedia. In particular, we study articles related to the history of all UN member states and compare them in 30 language editions. We develop a computational approach that allows to identify focal points quantitatively, and find that Wikipedia narratives about national histories (i) are skewed towards more recent events (recency bias) and (ii) are distributed unevenly across the continents with significant focus on the history of European countries (Eurocentric bias). We also establish that national historical timelines vary across language editions, although average interlingual consensus is rather high. We hope that this paper provides a starting point for a broader computational analysis of written history on Wikipedia and elsewhere

arXiv.org e-Print Archive

Publikationsserver der RWTH Aachen University

Association for the Advancement of Artificial Intelligence: AAAI Publications

Mapping bilateral information interests using the activity of Wikipedia editors

Author: Bohlin Ludvig
Karimi Fariba
Lancichinetti Andrea
Rosvall Martin
Samoilenko Anna
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

We live in a global village where electronic communication has eliminated the geographical barriers of information exchange. The road is now open to worldwide convergence of information interests, shared values, and understanding. Nevertheless, interests still vary between countries around the world. This raises important questions about what today's world map of in- formation interests actually looks like and what factors cause the barriers of information exchange between countries. To quantitatively construct a world map of information interests, we devise a scalable statistical model that identifies countries with similar information interests and measures the countries' bilateral similarities. From the similarities we connect countries in a global network and find that countries can be mapped into 18 clusters with similar information interests. Through regression we find that language and religion best explain the strength of the bilateral ties and formation of clusters. Our findings provide a quantitative basis for further studies to better understand the complex interplay between shared interests and conflict on a global scale. The methodology can also be extended to track changes over time and capture important trends in global information exchange.Comment: 11 pages, 3 figures in Palgrave Communications 1 (2015

arXiv.org e-Print Archive

Publikationer från Umeå universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

SynGraphy: Succinct Summarisation of Large Networks via Small Synthetic Representative Graphs

Author: Kumar Pawan
Kunegis Jérôme
Pirró Giuseppe
Samoilenko Anna
Sun Jun
Publication venue
Publication date: 15/02/2023
Field of study

We describe SynGraphy, a method for visually summarising the structure of large network datasets that works by drawing smaller graphs generated to have similar structural properties to the input graphs. Visualising complex networks is crucial to understand and make sense of networked data and the relationships it represents. Due to the large size of many networks, visualisation is extremely difficult; the simple method of drawing large networks like those of Facebook or Twitter leads to graphics that convey little or no information. While modern graph layout algorithms can scale computationally to large networks, their output tends to a common "hairball" look, which makes it difficult to even distinguish different graphs from each other. Graph sampling and graph coarsening techniques partially address these limitations but they are only able to preserve a subset of the properties of the original graphs. In this paper we take the problem of visualising large graphs from a novel perspective: we leave the original graph's nodes and edges behind, and instead summarise its properties such as the clustering coefficient and bipartivity by generating a completely new graph whose structural properties match that of the original graph. To verify the utility of this approach as compared to other graph visualisation algorithms, we perform an experimental evaluation in which we repeatedly asked experimental subjects (professionals in graph mining and related areas) to determine which of two given graphs has a given structural property and then assess which visualisation algorithm helped in identifying the correct answer. Our summarisation approach SynGraphy compares favourably to other techniques on a variety of networks.Comment: 24 page

arXiv.org e-Print Archive

(Don't) mention the war: A comparison of Wikipedia and Britannica articles on national histories

Author: Génois Mathieu
Jadidi Mohsen
Lemmerich Florian
Samoilenko Anna
Strohmaier Markus
Zens Maria
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

International audienc

Crossref

MAnnheim DOCument Server

HAL Descartes

Publikationsserver der RWTH Aachen University

Hal-Diderot

E-commerce in Russia: Challenges and Opportunities : Russian e-commerce market for local and foreign enterpreneurs

Author: Samoilenko Anna
Publication venue: Metropolia Ammattikorkeakoulu
Publication date: 01/01/2016
Field of study

The work presented in this thesis explores Russian e-commerce as a rapidly growing and challenging industry quickly gaining popularity. The main focus will be on how e-retail business is progressing in Russia and which opportunities it offers as well as what kind of challenges it is bringing to local and foreign entrepreneurs willing to start e-commerce business in the country

Theseus

Linguistic influence patterns within the global network of Wikipedia language editions

Author: Edler Daniel
Karimi Fariba
Kunegis Jérôme
Samoilenko Anna
Strohmaier Markus
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

MAnnheim DOCument Server

Analysing timelines of national histories across Wikipedia editions: A comparative computational approach

Author: Lemmerich Florian
Samoilenko Anna
Strohmaier Markus
Weller Katrin
Zens Maria
Publication venue: AAAI Press
Publication date: 01/01/2017
Field of study

arXiv.org e-Print Archive

MAnnheim DOCument Server

Publikationsserver der RWTH Aachen University

Association for the Advancement of Artificial Intelligence: AAAI Publications

Analyse rigoureuse de surfaces selectives en frequence

Author: Edler Daniel
Karimi Fariba
Kunegis Jerome
Samoilenko Anna
Strohmaier Markus
Publication venue
Publication date: 01/01/1991
Field of study

SIGLEAvailable from INIST (FR), Document Supply Service, under shelf-number : T 81666 / INIST-CNRS - Institut de l'Information Scientifique et TechniqueFRFranc

arXiv.org e-Print Archive

Publikationer från Umeå universitet

Springer - Publisher Connector

MAnnheim DOCument Server

Publikationsserver der RWTH Aachen University

Digitala Vetenskapliga Arkivet - Academic Archive On-line

OpenGrey Repository